On Non-Intrusive Workload-Aware Database Replication
نویسندگان
چکیده
Performance and high-availability are the crucial factors in the development of nowadays distributed database systems. Both of these challenges are commonly addressed by means of the same technique, database replication. The overall throughput of the system is increased by leveraging parallel computation on different sites, and in case of replica failures, availability is improved by redirecting requests to operational replicas. However, providing transparent database replication is not an easy task. Although database replicas should be as independent of each other as possible for performance and availability reasons, some synchronization is required to provide data consistency. This thesis is about non-intrusive (or middleware) database replication protocols. More specifically, this thesis focuses on the development of practical replication protocols that use off-the-shelf database engines, take advantage of group communication primitives, cope with failures of system components, behave correctly, and, by exploiting the specific characteristics of the application, achieve high performance. In the first part of this thesis we address the following problem: non-intrusive database replication protocols cannot obtain fine-grained information about transactions due to limited access to the database engine internals. We make three contributions in this part. The first contribution is the Multiversion Database State Machine, a middleware extension of the Database State Machine, a kernelbased replication approach. The Multiversion Database State Machine assumes predefined, parameterized transactions. The particular data items accessed by a transaction depend on the transaction’s type and the parameters provided by the application program when the transaction is instantiated. The second contribution of this thesis is a technique to bypass the extraction and propagation of readsets and writesets in non-intrusive replication protocols. We present the SQL Inspector, a tool capable to automatically identify conflicting transactions before their actual execution by partially parsing them. The performance of the Multiversion Database State Machine can be further improved if transactions execute at carefully chosen database sites. Thus, the third contribution of this thesis is
منابع مشابه
Scaling transactional workloads on the cloud
In this paper, we address the problem of transparently scaling out transactional (OLTP) workloads on relational databases, to support database-as-a-service in cloud computing environment. The primary challenges in supporting such workloads include choosing how to partition the data across a large number of machines, minimizing the number of distributed transactions, providing high data availabi...
متن کاملDolly: Database Provisioning for the Cloud
The Cloud is an increasingly popular platform for e-commerce applications that can be scaled on-demand in a very cost effective way. Dynamic provisioning is used to autonomously add capacity in multi-tier cloud-based applications that see workload increases. While many solutions exist to provision tiers with little or no state in applications, the database tier remains problematic for dynamic p...
متن کاملSkyler: Dynamic, Workload-Aware Data Sharding across Multiple Data Centres
Popular online services such as Facebook and Twitter use multiple data centers (DCs) to service globally distributed user requests with the lowest possible latency. Existing distributed data stores either fully replicate or statically shard data across DCs. Full replication limits scalability and static sharding fixes data to a single sharding policy, unable to adapt with user and popularity sh...
متن کاملWarwick-JLR Driver Monitoring Dataset (DMD): A public Dataset for Driver Monitoring Research
Driving is a safety critical task that requires the full attention of the driver. Despite this, there are many distractions throughout a vehicle that can impose extra workload on the driver, diverting attention from the primary task of driving safely. If a vehicle is aware that the driver is currently under high workload, the vehicle functionality can be changed in order to minimize any further...
متن کاملMiddleware Support for Locality-aware Wide area Replication
Coherent wide-area data caching can improve the scalability and responsiveness of distributed services such as wide-area le access, database and directory services, and content distribution. However, distributed services differ widely in the frequency of read/write sharing, the amount of contention between clients for the same data, and their ability to make tradeoffs between consistency and av...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009